Kernels from Probabilistic Models for Multiclass Classification and Reranking with Arbitrary Loss Function

نویسندگان

Ivan Titov

James Henderson

چکیده

Many classification problems involve loss functions different from the usual zero-one classification loss. In recent years, several approaches to accommodate loss functions in kernel-based learning algorithms have been suggested, but the construction of kernels has not been motivated by specific loss functions. We propose a method for deriving kernels from probabilistic models, which is tailored to a given loss function. We evaluate this method empirically using a natural language statistical parser as the probabilistic model, a SVM with slack rescaling as the learning algorithm for reranking of candidate parses provided by the statistical parser, and F1 measure over bracketed constituents as the loss function. The method with the proposed kernel achieves a significant improvement in F1 measure over the results with a kernel motivated by a zero-one loss function (a TOP kernel generalization for reranking) and over results of the statistical parser alone.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Loss Minimization in Parse Reranking

We propose a general method for reranker construction which targets choosing the candidate with the least expected loss, rather than the most probable candidate. Different approaches to expected loss approximation are considered, including estimating from the probabilistic model used to generate the candidates, estimating from a discriminative model trained to rerank the candidates, and learnin...

متن کامل

ارائه یک مدل احتمالاتی برای توزیع خوردگی یکنواخت در سکوهای ثابت فلزی در خلیج فارس

For structural reliability assessment or risk analysis of aging offshore steel structures, it is essential to have a probabilistic model, which contains specific statistical parameters, and predicts long term corrosion loss as a function of time. The aim of this study is to propose such model for offshore jacket platforms in the Persian Gulf. Field measurements for material loss due to uniform ...

متن کامل

Data-Defined Kernels for Parse Reranking Derived from Probabilistic Models

Previous research applying kernel methods to natural language parsing have focussed on proposing kernels over parse trees, which are hand-crafted based on domain knowledge and computational considerations. In this paper we propose a method for defining kernels in terms of a probabilistic model of parsing. This model is then trained, so that the parameters of the probabilistic model reflect the ...

متن کامل

MORD: Multi-class Classifier for Ordinal Regression

We show that classification rules used in ordinal regression are equivalent to a certain class of linear multi-class classifiers. This observation not only allows to design new learning algorithms for ordinal regression using existing methods for multi-class classification but it also allows to derive new models for ordinal regression. For example, one can convert learning of ordinal classifier...

متن کامل

Deriving Kernels from MLP Probability Estimators

In multi-class categorization problems with a very large or unbounded number of classes, it is often not computationally feasible to train and/or test a kernel-based classifier. One solution is to use a fast computation to pre-select a subset of the classes for reranking with a kernel method, but even then tractability can be a problem. We investigate using trained multilayer perceptron probabi...

متن کامل

ذخیره در منابع من

ذخیره در منابع من قبلا به منابع من ذحیره شده

{@ msg_add @}

با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره شماره

صفحات -

تاریخ انتشار 2005

Kernels from Probabilistic Models for Multiclass Classification and Reranking with Arbitrary Loss Function

نویسندگان

چکیده

منابع مشابه

Loss Minimization in Parse Reranking

ارائه یک مدل احتمالاتی برای توزیع خوردگی یکنواخت در سکوهای ثابت فلزی در خلیج فارس

Data-Defined Kernels for Parse Reranking Derived from Probabilistic Models

MORD: Multi-class Classifier for Ordinal Regression

Deriving Kernels from MLP Probability Estimators

عنوان ژورنال:

اشتراک گذاری